NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fairness in Ranking under Disparate Uncertainty

https://doi.org/10.1145/3689904.3694703

Rastogi, Richa; Joachims, Thorsten (October 2024, ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO))

Full Text Available
End-to-end Training for Recommendation with Language-based User Profiles

Gao, Zhaolin; Zhou, Joyce; Dai, Yijia; Joachims, Thorsten (October 2024, RecSys Workshop on Risks, Opportunities, and Evaluation of Generative Models in Recommender Systems (ROEGEN))

Full Text Available
Large language models, social demography, and hegemony: comparing authorship in human and synthetic text

https://doi.org/10.1186/s40537-024-00986-7

Alvero, A J; Lee, Jinsook; Regla-Vargas, Alejandra; Kizilcec, René F; Joachims, Thorsten; Antonio, Anthony Lising (December 2024, Journal of Big Data)

Full Text Available
Ending Affirmative Action Harms Diversity Without Improving Academic Merit

https://doi.org/10.1145/3689904.3694706

Lee, Jinsook; Harvey, Emma; Zhou, Joyce; Garg, Nikhil; Joachims, Thorsten; Kizilcec, René F (October 2024, ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (EAAMO))

Full Text Available
Ranking with Slot Constraints

https://doi.org/10.1145/3637528.3672000

Guo, Wentao; Wang, Andrew; Thymes, Bradon; Joachims, Thorsten (August 2024, ACM Conference on Knowledge Discovery and Data Mining (KDD))

Full Text Available
MultiScale Policy Learning for Alignment with Long Term Objectives

Rastogi, Richa; Saito, Yuta; Joachims, Thorsten (July 2024, ICML Workshop on Models of Human Feedback for AI Alignment)

Full Text Available
REBEL: Reinforcement Learning via Regressing Relative Rewards

Gao, Zhaolin; Chang, Jonathan; Zhan, Wenhao; Oertell, Owen; Swamy, Gokul; Brantley, Kianté; Joachims, Thorsten; Bagnell, J Andrew; Lee, Jason; Sun, Wen (December 2024, 38th Conference on Neural Information Processing Systems (NeurIPS 2024))

Full Text Available
Coactive Learning for Large Language Models using Implicit User Feedback

Tucker, Aaron David; Brantley, Kianté; Cahall, Adam; Joachims, Thorsten (July 2024, International Conference on Machine Learning (ICML))

We propose coactive learning as a model and feedback mechanism for training large language models (LLMs). The key insight is that users provide implicit feedback whenever they edit the text y proposed by an LLM. While the edited text y¯ is typically not a gold-standard example for supervised training, coactive learning merely requires that the edited text y¯ is an improvement over the proposed text y. Note that such weak implicit preference feedback y¯≻y is available in many application settings on a per-user basis, thus enabling the personalization of LLMs. In this paper, we develop the theoretical basis for coactive training of non-linear models, and we derive CoRLL as the first coactive learning algorithm for LLMs. Empirical results indicate that CoRLL is effective even for weak and noisy coactive preference feedback, making it a promising algorithm for training and personalization of LLMs from feedback that is naturally collected in many use cases.
more » « less
Full Text Available
Prompt Optimization with Logged Bandit Data

Kiyohara, Haruka; Saito, Yuta; Cao, Daniel Yiming; Joachims, Thorsten (May 2024, ICLR Workshop on Navigating and Addressing Data Problems for Foundation Models (DPFM))

Full Text Available
Language-Based User Profiles for Recommendation

Zhou, Joyce; Dai, Yijia; Joachims, Thorsten (March 2024, WSDM Workshop on Large Language Models for Individuals, Groups, and Society)

Full Text Available

« Prev Next »

Search for: All records